Stochastic control optimal in the Kullback sense
نویسندگان
چکیده
The paper solves the problem of minimization of the Kullback divergence between a partially known and a completely known probability distribution. It considers two probability distributions of a random vector (u1, x1, . . . , uT , xT ) on a sample space of 2T dimensions. One of the distributions is known, the other is known only partially. Namely, only the conditional probability distributions of xτ given u1, x1, . . . , uτ−1, xτ−1, uτ are known for τ = 1, . . . , T . Our objective is to determine the remaining conditional probability distributions of uτ given u1, x1, . . . , uτ−1, xτ−1 such that the Kullback divergence of the partially known distribution with respect to the completely known distribution is minimal. Explicit solution of this problem has been found previously for Markovian systems in Karný [6]. The general solution is given in this paper.
منابع مشابه
From information theoretic dualities to Path Integral and Kullback Leibler control: Continuous and Discrete Time formulations
This paper presents a unified view of stochastic optimal control theory as developed within the machine learning and control theory communities. In particular we show the mathematical connection between recent work on Path Integral (PI) and Kullback Leibler (KL) divergence stochastic optimal control theory with earlier work on risk sensitivity and the fundamental dualities between free energy a...
متن کاملApplication of Stochastic Optimal Control, Game Theory and Information Fusion for Cyber Defense Modelling
The present paper addresses an effective cyber defense model by applying information fusion based game theoretical approaches. In the present paper, we are trying to improve previous models by applying stochastic optimal control and robust optimization techniques. Jump processes are applied to model different and complex situations in cyber games. Applying jump processes we propose some m...
متن کاملAn Application of the Stochastic Optimal Control Algorithm (OPTCON) to the Public Sector Economy of Iran
In this paper we first describe the stochastic optimal control algorithm called ((OPTCON)). The algorithm minimizes an intertemporal objective loss function subject to a nonlinear dynamic system in order to achieve optimal value of control (or instrument) variables. Second as an application, we implemented the algorithm by the statistical programming system ((GAUSS)) to determine the optimal fi...
متن کاملOptimal Stochastic Control in Continuous Time with Wiener Processes: General Results and Applications to Optimal Wildlife Management
We present a stochastic optimal control approach to wildlife management. The objective value is the present value of hunting and meat, reduced by the present value of the costs of plant damages and traffic accidents caused by the wildlife population. First, general optimal control functions and value functions are derived. Then, numerically specified optimal control functions and value func...
متن کاملNumerical Solution of Optimal Heating of Temperature Field in Uncertain Environment Modelled by the use of Boundary Control
In the present paper, optimal heating of temperature field which is modelled as a boundary optimal control problem, is investigated in the uncertain environments and then it is solved numerically. In physical modelling, a partial differential equation with stochastic input and stochastic parameter are applied as the constraint of the optimal control problem. Controls are implemented ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Kybernetika
دوره 44 شماره
صفحات -
تاریخ انتشار 2008